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DEFENSE DATA NETWORK (DDN) PERFORMANCE ANALYSIS USING 

PROBABILITY MODELING 



D. P. Gaver 
P. A. Jacobs 

SUMMARY 

The Data Defense Network (DDN) is a large packet switching network 
that services elements of the U.S. Department of Defense (DOD). The 
emphasis of this report is to perform probabilistic analysis of certain features 
of the DDN system with a view of enhancing or "optimizing" measures of 
service such as data base throughput and the waiting times experienced by 
data-base-transfer customers. The particular questions addressed, and the 
models constructed, are in response to tasking statements supplied by 
personnel from the Defense Communications Engineering Center, Reston, 
Virginia. 

In Section 2 models are presented to study the optimal length of a packet 
♦* 

subject to transmission errors. When a data transfer is to occur the total 
collection of bits that comprise the data base is divided into packets, i.e., 
subcollections of contiguous bits from the data base plus a header carrying 
address information. Packets from the source node, S, travel through a 
number of packet switching nodes (PSN's) to the destination node, D. The 
number of PSN's encountered by a packet during its transit from S to D is 
called the number of hops. Each bit in the packet is susceptible to corruption 
by various sources along the way. If any bit in the packet is so corrupted the 



entire packet is viewed as useless and must be retransmitted. A large packet is 
more likely to suffer contamination than is a small one. On the other hand, 
each packet of any size has about the same number of header bits and so small 
packets require the transfer of an uneconomically large amount of header 
information as compared to the "live" bits that are the data to be transferred. 
The question addressed in Section 2 is how to choose the number b of live bits 
per packet when a) the successful transmission probability per bit is p, b) the 
number of header bits per packet is fixed and equal to h and c) the number of 
hops is fixed at J. Simple approximate formulas are obtained for the packet 
length which maximizes the transmission rate of live bits. 

A message is comprised of a number of packets. The Appendix A 
contains a revision of a prior report which introduces an initial model to 
study the effect of the number of packets in a message upon data transfer 
time. The model incorporates the following features. At an initial instant 
(t=0) a group of m>l packets comprising a message is sent from a source S to a 
destination D. The m packets are sent out simultaneously via the network. 
The first to arrive and find space for the m-packet message occupies a one- 
packet space and reserves space for m-1 others and sends an 
acknowledgement back to S. If no acknowledgement is received by S in time 
5 each packet is retransmitted; this action is repeated until acknowledgement 
occurs. Such is necessary because packets may be lost due to encountering full 
buffers along the way, etc. Once the initial reservation is made a time elapses 
until the remaining m-1 packets of the message (originals or duplicates 
thereof) reach D: each subsequent packet experiences delays similar to the 
first, but need not reserve space in D's buffer. The model of the Appendix is 
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used to obtain an expression for the long run rate of transfer of data packets 
from source S. The number of packets per message which maximizes the 
transfer rate is investigated numerically. As noted in the Appendix, while 
model does account for retransmission by the tagged source, the possible effect 
of increased congestion due to retransmission by other sources is not 
explicitly modeled. 

In Section 3 models are introduced to study the effect on D's buffer of all 
sources retransmitting at a retransmission interval of length 5. We also 
model the behavior of one form of congestion control, exponential backoff, a 
procedure that increases successive time-out intervals possibly from 5 to 25, 
25 to 45, etc. The models have as inputs the arrival rate of new or original 
packets to D's buffer, X; the length of a packet in terms of its deterministic 
service time s; the retransmission rate \) = 1/5; and the size of D's buffer, B. 
The models are of the amount of work in D's buffer. For a moderate original 
traffic intensity A.s=0.5 the models indicate the following. For small 
retransmission rate u, the amount of work in D's buffer tends to be small. For 
a large retransmission rate \), the buffer tends to be nearly full. If the 

I* 

retransmission rate i) takes on a range of intermediate values bistability may 
occur; the work in the buffer resides for a while at low values, then switches 
to a much higher value with the buffer nearly full; after a time there is a 
switch again to low values, then back to high values, etc. It is shown that 
exponential backoff can alleviate the unsatisfactory behavior of full buffers 
and bistability. 

Section 3 also contains models for the response time of a tagged packet 
sent to D in the environment of the other packets contending for space in D's 
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buffer also being retransmitted. It is found that there is a best retransmission 
interval 5. Too large a retransmission time 5 means too much time is wasted 
in recovery from a lost packet. Too small a retransmission time 5 leads to a 
filling of D's buffer due to retransmission by other sources and results in 
increased blocking of the tagged packet. 

In summary, probabilistic models have been constructed to study the 
effect of packet size and message size on system throughput and response 
time of a data-transfer operation. The effect of packet retransmission on 
response times has also been evaluated. 
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DEFENSE DATA NETWORK (DDN) PERFORMANCE ANALYSIS USING 

PROBABILITY MODELING 

D. P. Gaver 
P. A. Jacobs 



1. INTRODUCTION 

The DDN is a large packet switching network that services elements of 
the U.S. DoD. It currently has on the order of 260 packet switching nodes 
(PSNs) that provide entry to the network. Each PSN is accessed from a 
number of local hosts, into which traffic flows on the way into, and out of, the 
DDN itself. The actual users are connected to the hosts via their terminals. A 
schematic of the relationships appears as Figure 1. 

Although the general levels of traffic, in terms of demands from users for 
capacity to transfer bits from their terminals and back, are known generally, 
this demand varies randomly in time to a considerable degree. It is 
apparently not practical to insist that demands for, say, transfer of a large data 
base ffom a user at terminal S to another at terminal D be done on a 
reservation basis. Consequently demands occur unpredictably, enter the 
network as packets, and the packets are routed from PSN to PSN according to 
an expeditious logic. If too many packets are admitted into the network 
simultaneously then a detrimental level of congestion occurs at the nodes: 
buffers that store the incoming datagrams become filled, and the following 
datagrams, encountering full buffers, are lost and must ultimately be 
retransmitted. Consequently the transfer of the associated customer's data 
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base is slowed. Congestion control measures are required to prevent this 
from happening, and to compensate for tendencies in the direction of excess 
congestion, where that congestion comes about in part from the bunchy, 
random nature of demands for service, i.e., data transfer. 

The emphasis of this report is to perform probabilistic analysis of certain 
features of the DDN system with a view to enhancing or "optimizing" 
measures of service such as data base throughput and the waiting times 
experienced by data-base-transfer customers. The particular questions 
addressed, and the models constructed, are in response to tasking statements 
supplied by personnel from the Defense Communications, Engineering 
Center, Reston, Virginia. 

2. PACKET SIZING 

When a data transfer is to occur in a network like DDN the total 
collection of bits that comprise the data base (db) is divided into packets, i.e., 
subcollections of contiguous bits from the db plus a header consisting of the 
bits carrying address information. Packets from a Source terminal are sent 
into the network via a host's PSN, and proceed along individual trajectories, 
i.e., via intermediate PSN's, to the PSN attached ultimately to the 
Destination's terminal. The trajectories or paths through the network are 
dictated by a routing algorithm with the general aim of minimizing transit 
time; individual packets may well follow different paths. The number of 
PSN's encountered by a packet during its transit from S to D is called the 
number of hops. When a packet reaches its destination it is typically 
acknowledged; information is returned to S to indicate that the particular 
packet has arrived. 
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2.1 Packet Size and Error Rate: Model 1 

As an individual packet proceeds from S to D it utilizes trunks and passes 
through buffers and is processed by switches at the PSNs hopped through. 
Each bit in the packet is susceptible to corruption by various sources along the 
way. If any bit in a packet is so corrupted the entire packet is viewed as useless 
and must be retransmitted. If bit corruption is viewed as a process that either 
does, or does not, occur independently from bit to bit then it is clear that a 
large packet is more likely to suffer contamination than is a small one. On 
the other hand each packet of no matter what size has about the same 
number of header bits, so small packets require the transfer of an 
uneconomically large amount of header information as compared to the 
“live" bits that are the data to be transferred. The question is: how to choose 
the number, b, of live bits per packet when a) the success probability per bit is 
p, and b) the number of header bits per packet is fixed and equal to h? 

Note that the above only recognizes one issue that may govern the 
number of live bits per packet. It focuses on error corruption only; if p is 
high, as will often be true, the optimal packet size will be large if only errors 
are important, as may be true when the network is very lightly loaded. Under 
heavy load conditions a large packet will prove to be non-optimal since large 
packets tend to be awkward for the PSN's buffers to handle. The latter issue 
will be addressed later in this report. 
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Model 1. Mathematical Formulation 

A simple analysis of Model 1 is possible. Let 

p = probability that a bit is successfully transmitted error-free from S to 
D. Successive bit conditions are independent; i.e., errors are like 
biased coin flips (are modeled as Bernoulli trials). 

h = number of bits in a header (and tailer); assumed constant. 

b = number of "live" bits, a decision variable. 

Let N(b) be the (random) number of bits that must be transmitted until a full 
packet of b live and h header bits arrives intact after one hop. As soon as such 
a packet arrives at D at the end of the hop another is instantly sent 
(acknowledgements are assumed to be instantaneous, or else the number of 
bits is incorporated in h). Clearly N(b) is the inter-event time in a discrete 
time renewal process; see Ross (1989). The sender receives a reward of b bits 
whenever N(b) terminates, meaning that at independently identically 
distributed random intervals (N(b)j, j = 1, 2, ...) a packet containing b live, 
uncontaminated bits arrives. Of course if p is realistically large there will be 
long strings of good all-live-bit packets, rarely punctuated by very short strings 

I* 

of contaminated packets. 

From the following conditional probability argument we can derive what 
is needed. Note that 

b+h with probability p*’+^ = 



N(b) = i 



b+h+N»(b) 



with probability i-pb+h=i_€"^(’’+^) 



( 2 . 1 ) 



where X = -In p. 

The first line of (2.1) says that a (new) reward of b bits is collected after one 
b+ h -bit packet arrives if all bits are uncontaminated. The second line says 
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that a reward of b is collected after the first (contaminated) packet is received 
plus the number of additional packets until an uncontaminated b+h -bit 
packet arrives; N*(b) has the same distribution as does N(b) unconditionally: 
after the passage of b+h contaminated packets the process "starts over." 

Now take expectations through (2.1): 



E[N(b)] = (b+h)pb+h+{(b+h)+E[N#(b)]} (l-p^+h) = (b+h)+E[N(b)](l-pb+h). (2.2) 



Now the long-run rate of accumulation of good=live (uncontaminated) 
packets per packet sent is 



To find the optimal b differentiate r(b) with respect to b and set the derivative 
equal to zero (despite the fact that b is discrete the result is accurate) 



so 



E[N(b)] =^b+h = (b+h)e^(*^+h)_ 



(2.3) 



E[N(b)] (b+h)e^^'^‘*^^^ 



(2.4) 



dr(b) [(b+h)e^(^+^)]-b[e^^+^^+ X(b+h) 



db 



((b+h)e^(b+h))2 



Cancel > 0 and simplify to obtain the simple quadratic: 

b^+hb-h/X = 0, 



the appropriate solution of which is 




(2.5) 



If p = 1 then X is small and positive, which sends bopt towards large values: 



ss X- — ^ 0. 



bopt ~'\lx 



( 2 . 6 ) 



Model 2. 

To generalize the previous model to packets traveling J hops, let Ai(b) be 
the number of additional bits needed to transmit a packet of length b over the 
it^’ hop, i = 1, ..., J. We assume that successful transmission of bits over each 
hop are independent events 



with probability p'’^^ 



Ai(b) = 



_ < 



(2.7) 



b+h + Aj (b) with probability 1-p^+h 



where A| (b) is the number of additional bits needed to transmit a packet of 



length b over the i^h hop after the first failed transmission. Letting X =-lnp as 
before and taking expectations results in 

E[Ai(b)] = (b+h) (i-e->-(b+h))e>^(b+h) = (b+h) [e>-(b+h)_i]. (2.g) 



t* 

Let Nj(b) be the number of packets needed to transmit a packet of length b 
over J hops. Then, 

J 

Nj(b) = b+h+XAi(b). (2.9) 

i=l 



Taking expectations results in 

E[Nj(b)] = b+h+J(b+h) [e^(b+h)_i], ( 2 . 10 ) 

The long run average rate at which active bits are accumulated is 
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E[NjCb)] (b+h)[l+J(e^^^+^^-l)] 
Taking derivatives with respect to b 

^ _ hfl-T(e>-(b+h)_i)i_b(b+hUTe^b^h) 

db ■ (b+h)2[l+J(e^b+h)_i)]2 

d 

Setting ^ r(b) = 0 results in the equation 

0 = h[l+J(e^(b+h)_i)]_b(b+h)Xje^(b+h). 



( 2 . 11 ) 



( 2 . 12 ) 



(2.13) 



This equation can be solved numerically to find the maximizing b. 

If X.«l; that is p=l; then equation (2.13) can be approximated by 

0 =h[l+Xj(b+h)]-b(b+h)Xj[l+)i(b+h)] (2.14) 

= h+Xjbh+XJh2-b2XJ-bhXj 
= h(l+)ijh)-b2?ij. 

The maximizing b is 

^ /h(l+X,Jh) ^ / h 
bopt = "ysj ~ V M' 

As themumber of hops increases the maximizing packet size decreases. Note 
that the above formula is easily generalized to show what happens if X 
becomes Xi = -In pi, so that the probability of successful transmission depends 
upon the exact (i‘^) hop under discussion. 

Figures 2 and 3 display the right hand side (RHS) of equation (2.14) for 
various parameter values. In Figure 2, h = 20 bits and J = 4 with p = 0.99999, 
0.99995. The approximate solution (2.15) gives yields bopt = 707 for p = 0.99999 
and bopt ~ 316 for p = 0.99995. As expected, a larger probability of successful 
transmission indicates a larger best packet size. In Figure 3, p = 0.99999, h = 20 
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and the number of hops is allowed to vary. The approximate formula (2.15) 
in these cases gives bopt = 1414 for J = 1, bopt = 707 for J = 4, and bopt = 500 for 
J = 8. Not surprising, the best packet size decreases as the number of hops 
increases. 

Comment. The above analysis tacitly assumes that success probability, p, or pi 
on link i, is known; it is also assumed that errors obey Bernoulli-trials laws. 
Neither is necessarily correct. An adaptive procedure might well be 
developed to react to changes in p or pi, particularly to indications of 
considerable decrease, evidence for which could come from demands for 
retransmit that are not related to losses resulting from encounters with full 
buffers. A rule that automatically contracts b when an error occurs that 
compels retransmit could be used. Likewise, a rule that expands b when no 
errors occur could be devised and evaluated. These problems could be taken 
up in future work. 



»♦ 
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3. MODELING A BUFFER THAT EXPERIENCES LOSSES AND RE- 
TRANSMISSIONS 

3.1 Problem Formulation 

We consider an aspect of the problem of data transfer in a packet 
switching store-and-forward network similar to the Defense Data Network 
(DDN). A source node, S, at which the data resides, wishes to transfer it to 
destination node D. It must do so via various intermediate nodes. The data 
at S is packetized, and individual packets are sent forth; if a packet is not 
acknowledged by a certain time, 5, the packet is re-transmitted, and so on 
until acknowledgement is received. The reason for this tactic is that packets 
may be lost in transit; loss may be caused by a packet's encountering a full 
buffer at some intermediate node. 

It is intuitively clear, and has been recognized widely, that an overly-short 
re-transmit time can clog the system with superfluous packets, bringing 
about drastic slowdowns; see Jacobson (1988), Zhang (1986) and Gerla and 
Kleinrock (1980). On the other hand, overly-lengthy re-transmit intervals can 

I* 

induce unnecessarily long or prolonged delays. It is of interest to formulate 
models that suggest a reasonable compromise position, and this is done here. 
The approach is admittedly approximate, but may form the basis for a better 
understanding of the situation, and for more realistic strategies. Related 
issues have been studied for other types of networks; cf. Heyman (1982, 1986) 
and Fredericks and Riesner (1978). 

Our analysis proceeds by first making a very simple fluid model for the 
contents of the buffer at D. Essentially we are characterizing the environment 



encountered by a packet, P, or tagged packet, i.e., a representative packet being 
sent from S to D as part of the data transfer of interest. Note that we assume 
that the other packets that contend with P for buffer space also re-transmit 
and occupy the buffer at D. It is their fate to influence, and be influenced by, 
the magnitude of 5. Thus if 5 is large, we expect relatively few superfluous 
packets at buffer D, and relative ease of entry for P provided she is not lost in 
transit. Decreasing 5 tends initially to speed the arrival of a P-copy to D, but if 
such decrease is universal, applying to all users of the buffer, it also implies 
that D accumulates more copies of other packets in storage, tending to 
overload D, leading to rejections of P-copies until one luckily squeezes in. 
Very likely the time for ultimate squeeze-in will be long if many are in 
competition. We seek for a reasonable compromise between the too-long 8 
that delivers excessive delays for lost packets and the too-short 5 that leads to 
an atmosphere of "global warming," losses, and possible system crash. We 
also model the behavior of exponential backoff, a device that increases 
successive time-out intervals, possibly from 5 to 25, 25 to 45, etc. See Jacobson 
(1988) and Aldous (1987). 

3.2 A simple Fluid Model 
(Model 1) 

Let w(t) represent the work present in D's buffer at time t after the process 
start; let B be the size of that buffer. By work we mean the number of time 
units required to serve, process, or forward those packets in the buffer at t; B is 
measured in work units. View w(t) as the expectation of W(t), a random 
process that would be an unwieldy generalization of the Takacs virtual work 
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process for the M/G/1 queue; see Takacs (1962). In Gaver (1964) a related 
Takacs model is considered. 



We propose this initial model: that if dt>0, and 0<w(t)<B, 



w\ 



'(f + dt) = w(t) + j^Adf|l - (w(f) / B)^j s[l + tw(f )] 



kw(t)dt 
1 + kw{t) 



or, letting dt^O, 




2 



with k = -. 
s 

Here we think of Xdt as being the probability that a new or original packet 

arrives at the designated buffer, the term l-(w(t)/B)P being then the 

probability that it enters D's buffer; we take p > 1; p can be adjusted to make 

the probability of loss resemble what is measured. The amount of work the 

arriving packet brings to D consists of two parts. The first part is the packet's 

service time s. The second part consists of the amount of work that will 

arrive due to retransmissions of the arriving packet; there will be gw(t)Huw(t) 
»♦ 

retransmits in the time a new arrival at time t must wait for service, w(t); 
each retransmission brings in additional work of duration s. We approximate 
the effect of the total work associated with an arriving packet by adding 
together the work associated with the original arrival, s, and the total work 
associated with all retransmissions t)w(t)s. Of course work is removed from 
the buffer at constant, unit, rate when w(t) > 0, and at zero rate when w(t) = 0. 
It is convenient to model this discontinuous effect by kw(t)/(l+kw(t)), a 
device apparently originally suggested by C. Agnew (1976); see also Filipiak 



(1988); the constant k is chosen so that the limiting solution to (3.1) as t — > <» 
when B = «o and u = 0 is the average work in queue for the M/G/1 queue. 

The resulting differential equation, (3.1), for w(t) is non-linear and of first 
order, and can be solved numerically. Certain qualitative features emerge by 
examining the right side; put p=Xs and assume p<l; viewed as a function of w 
we have 




(3.2) 



For \) small, then R(w;u) has a unique zero at 0 < W] < B. 



R(w;u) 



P 




w 



Figure 4 



If u » 1 (large), then a larger unique zero, W 2 , exists. 
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R(w;\)) 




At intermediate values of \) multiple (three) zeros can occur: 
R(w;u) 




These zeros inform us as to the time-dependent behavior of w(t): 

a) If u = 0, so re-transmits occur very rarely, then w(t) resembles the 
mean behavior of an ordinary single-server queue with finite buffer. 
If p < 1 (somewhere around 0.5 may be reasonable in practice), then 
w(t) will tend to be small, i.e., at level wi. 

b) If u » 1, so re-transmits occur quite rapidly, then the buffer tends to 
be quite full, i.e., at level W 2 , with W 2 close to B. 

c) If x> takes on a range of intermediate values bistability may occur: 
the work in the buffer resides for a time at a low value, W 3 i in the 



long run, then switches to a much higher value, W32; after a time 
there is a switch back to (near) W31, then back to W32, etc. 

The values W31 and W32 in Figure 6 are local stable points because if 

w(t) = w < W31 then R(w;\>) = dw/dt is positive, which means that 

w(t+dt) > w(t), sending w(t+dt) up towards W31; if w(t) = w>W3i (slightly 

above) then R(w;u) = dw/dt is negative, sending w(t+dt) down towards W31. 

Consequently w(t) tends to remain near the stable equilibrium W31. If chance 

events send the buffer contents sharply higher than W31 then the same 

behavior will occur at w-values near W32, another locally stable equilibrium. 

The remaining zero, between W31 and W32, is seen to be an unstable 

equilibrium point: any value of w just to its left is associated with a negative 

derivative, sending system state towards W31; any value just to its right 

pushes that state toward W32. Bistability has also been noted in models of 

circuit switched communication networks; Gibbens, et. al. (1988). 



3.3 "The" Long-run Solution 

Since for fixed parameters \), B and p, w(t) is bounded and displays no 
absorbing states, it is plausible that there be a long-run solution w = lim w(t) 

it t—>oo 



that satisfies the equation obtained by letting dw/dt=0 in (2.1); i.e.. 



1 - 



(vw + 1 ) = 0. 

1 + kw 



(3.3) 



It is natural to take p<l, perhaps p=0.6 or less, so that the D- buffer is 
normally rather lightly loaded. 

Let 
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/(ui) = 



(3.4) 



and 



with k = 



1- 



U)\P 



kB^ 



\ 

(iw + l) 

/ 




(3.5) 



Note that f(0) = 1, f(B) = 0, g(0) = 0, g(B) = — ^ function of w, g is 

strictly increasing. For fixed service time s, as a function of X, g increases as X 
decreases. 

Differentiating, 



dw‘ 






TV 



(3.6) 



For p = 1, f is a quadratic function and 



div 



f{w) = 



1 ^U>V 

B B 



2ivv ( 1 

+ i» = + V 

B [ B 



(3.7) 



Hence if u < g, then f is decreasing on [0,B] and there will be 1 zero for the 



equation f - g; the zero will be close to zero. As u increases, f is an increasing 
function of x>. For large u, f - g will also have 1 zero on [0,B], but the zero will 
be close to B. For intermediate values of u, f - g will have 3 zeros on [0,B]. 
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Unfortunately there is no clean crisp way of characterizing the shift from one 
zero to three, so we rely on numerical calculations. 

For p > 1, 



dw 



/(w)| = u 

ze = 0 



and f(w) decreases as w increases on [0,B] for fixed u and B. Hence, f is a 

concave function with a unique maximum on [0,B]. The equation f - g will 
have at least 1 zero in [0,B]. For small x> and large t), f-g will have 1 zero. 
For intermediate u, f - g can have 3 zeros. Figure 7 shows graphs of f and g 
for various values of X and x> with B = 10, s = 1, and p = 20. 

The solutions of (3.3) are precisely the long-run values of w(t) discussed 
earlier: in the long run the work in the buffer is located near wi or W2 when 
the re-transmit rate x> is either small or large, but hops between W31 and W32 
when X) is of intermediate magnitude, these hops being caused by random 
events that are not modeled here. A more detailed stochastic analysis is 
required to fully explicate the situation. But presumably as x> increases, the 
stationary density of W, a random variable representing long-run buffer 
contents, will concentrate at a low value near wi; its density function will 
gradually exhibit a vestigial mode near W32 which grows in size if u increases. 
Finally, if u becomes much larger, the vestigial mode at W31 will vanish to be 
replaced with a major mode at W2 = B. It is perhaps even more important to 
note that such hopping between modes, new preference for the higher mode 
(essentially full buffer) can equally well be in response to increase in primary 
traffic rate X with 5 kept fixed. 
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3.4 On Mean Packet Response Time 



Now consider a tagged packet, P, in the environment modeled. Let a be 
the probability that the tagged packet is lost (encounters a full buffer) on its 
way to D. Let x(\)) = E[Tw(u)], where Tw(u) denotes the response time of the 
tagged packet in the environment of u-rate re-transmits. 

Then, by a renewal-type backward argument. 



r(i>) = a 



— + wi + t ( u ) 



a{w{v) / Bf 






a|l - (w{v) / fif j(wD + w( 

(3.8) 



where wi is the delay experienced while P percolates through nodes 
intermediate between S and D, conditional on its being lost before reaching D; 

P 

WD is the time to percolate through to the destination node, and (w(u)/B) is 
the probability that it is finally lost at D. In the first two aforementioned cases 
P's response time starts over again after loss, while if the packet finally 
reaches D, after successfully passing through the intermediate nodes, it 
experiences the expected delay present at D, namely w(u) in addition to the 
total percolation time wd- 

Example. Figure 8 displays the expected response time as a function of the 
retransmission rate u for a simple case of the previous model. For the 
examples presented here wi = wd = 0 and p = 1 • The amount of work in the 
system when the tagged packet arrives at the node, w(o), is taken to be the 
steady state solution satisfying the equation (3.1). If equation (3.1) has 
multiple solutions, the largest solution is picked as a conservative choice. 
Under these assumptions (4.1) reduces to 




Solving results in 




(3.9) 



Two curves are presented in Figure 8, one for a = 0.0 and the other for a = 0.1. 
The other parameters are X = 0.5, s = 1, and B = 10. In both cases, there is a 
retransmission rate that minimizes expected response time x(\)). The 
minimizing retransmission rate is larger for a = 0.1 than for a = 0.0 as is to be 
expected. 

3.5 Fluid Model II: Retries 

In order to achieve ultimate simplicity the former fluid model omitted 
explicit consideration of primary retries: those original or new packets that 
are lost immediately because they encounter a full destination buffer. In 
order to represent their effect it is necessary to introduce a new state variable, 
r(t), which represents the number of different packets in the retry state at time 
t; each of these gives rise to repeated work requests until entry is achieved. 

The evolution of r(t) is described by the differential equation 



The first term on the RHS represents those original packets that encounter a 
full buffer and hence enter the retry population. The second RHS term 
represents the rate of retries from that population that enter the buffer. The 




(3.10) 
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third term, -0r(t), is a control that depletes the retry population as it grows; 
this corresponds to a timeout on packets that are unable to make the original 
entry. 

The corresponding equation for the actual buffer contents, w(t), is now 



the addition of \)r(t) to X in the first term of the RHS represents the arrivals 
from the retry population. 

Note that it has unfortunately been necessary to expand the state space to 
incorporate the effect of retries, which will quite likely be rare if designs are 
conservative. 

The long-run or steady-state situation is modeled by putting 



Now we can re-define the functions f and g previously introduced and use 
them to compute the solutions of (3.12), i.e., the stable points: 



dw{t) 

dt 




dr(t) dw(t) 



= 0; one can then solve for r=r(<») from (3.10) and substitute into 



dt " dt 



(3.11) to obtain a single equation for w=w(°o): 





(3.13) 



and 



1 r {2/s)iv ^ 
As 1^1 + (2/ s)w j 



(3.14) 
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Notice that f(0;v>) = 1, f(B;o) = 0 for all 0>O, a desirable behavior. Also, since 
the effect of original packet arrivals is entirely concentrated in g(w) it is clear 
from examination of Figures 9 and 10 that increase in either X (arrival rate) or 
s (packet size) will drive g(w) lower for each fixed w, thus increasing the 
propensity for the higher stable point, that near B, to make its appearance. 
The parameters of Figure 9 are s = 1, B = 10, p = 20, u = 0.3, 0 = 0.05 and X = 1, 
0.5, 0.3. The parameters of Figure 10 are traffic intensity p = X/s = 1/3, B = 10, 
p = 20, u = 0.3, 0 = 0.05 and service times s = 0.1, 1, and 1.5. Of course this is the 
condition that suggests that retransmission rate u is too high, and could be 
reduced to accommodate new conditions. Such a general procedure has been 
suggested in the past and is termed exponential backoff. Exponential backoff 
has been theoretically treated by Aldous (1987) and others for another type of 
network. It has also been incorporated into an engineering treatment by 
Jacobson (1988) and others. The next model incorporates exponential backoff 
into the previous model. Again, the attempt is to parsimoniously trace the 
impact of a rule followed universally by all users of the facility (buffer) under 
examination. It may be cogently argued that such a universal rule is not fair, 
since it treats all users similarly, regardless of whether they are "large" or 
"small." If user priorities are equal then it may seem reasonable to 
accommodate more quickly to the small users than to the large: a lost or 
rejected (from the buffer) small user is allowed to retransmit sooner than is a 
large. On the other hand, if many small users are given this privilege then a 
few large users may be undesirably delayed. We do not attempt to resolve 
this issue in what follows. 
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3.6 Fluid Model III: Exponential Backoff 

Exponential backoff allows the retransmit interval to become dynamically 
adaptive to evidence of increased congestion. 

Let n(w) be the largest value of n such that 

6[l+p+p2+... +pn] < w (3.15) 

where 5 is the minimum time between retransmissions and P>1 is the 
backoff factor; n(w) is the number of retransmissions to occur in a time w. An 
integral approximation to the LHS of (3.15) is 

= 5j^^"^Kxp{x\r\p}dx = :j^[exp{n(u;)ln^} - 1]. (3.16) 



Thus, an approximation to n(w) can be found by solving the equation 

5 



w = 






[exp{n(u;) In/j}- 1] 



(3.17) 



Hence, 



n{w) ~ c(w) = — + lovlnji] 

Inp 



(3.18) 



1 



where u = ^ is the original retransmission rate. 

A fluid model for the work in D's buffer at time t, w(t), with backoff and 
retries satisfies the system of equations 

\P 



m = 4 - w(o 

dt [ B ) ^ ^ 



'-(fl 



- er(t) 



(3.19) 



dw{t) 

dt 





{2/s)w{t) 
l + {2 / s)w{t) 



(3.20) 
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where c(w) is defined in (3.18). 



As before, the long run or steady-state situation is modeled by putting 
dr(t) dw(t) 

- = 0; one can then solve for r = r(w) and substitute into (3.20) to 
obtain a single equation for w=w(oo): 



0 = As 



-(ty/B)Pj[l + c(w)] 



u + B 



v\l 






+ d 



(2 / s)w 
1 + (2 / s)zv 



( 3 . 21 ) 



Let 



f{w;v) 




{w / Bf^l + c{w)] 




v + d 

{w/Bf^ + e 



( 3 . 22 ) 



and g(w) be as in (3.14). Notice that f(0;u) = 1 and f(B;u) = 0 for all 6>0. The 
effect of exponential backoff is contained in f. Examination of Figure 11 
indicates that increase in the backoff factor P will drive f(w;u) lower, thus 
decreasing the propensity for the higher stable point. The parameters of 
Figure 11 are A = 1/3, s = 1, p = 20, u = 0.3, 6 = 0.05 and backoff parameter 
P = 1.01, 1.5 and 3. Hence exponential backoff will provide some protection 
against a system crash. 

Consider a tagged packet P in the environment with retries and backoff. 
We will model P's expected response time as follows. Let mj = E[Tj] where Tj 
is the additional response time needed after i unsuccessful retransmissions. 
Using the notation of section 3.4, the following recursive relationship can be 
written 
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(3.23) 



m,- = + wi + W /+1 j 

+a(w{v,p) / Bf^p'd + Wq + m,+i] 

+a^l-(zv{v,p)/Bf^W[)+w{v,p)] 

where w(t>,P) denotes the expected amount of work present in D's buffer. 
Setting Y = (w(u,P)/B)*’ and 

d = ocwi + ayiO£) + ay[wo + w{v,p)] 

= cavi +awQ + ayiv{v,p), ( 3 - 24 ) 

substitution yields 

Wg = d+ 5[a + ay\ + [1 - a y]?n| 

= d+S[a + ay] + [l - a f][d + p5[a + ay] + [l - a y]m 2 ] 



= d l + [l-a y] + [l- a y]^+"-+[l- a y]”j 
+5[a + ay]|l + [l - ay]P +- • • +[1 - a yj" ” | 

Thus, if p[l - ay] < 1, then the expected response time 

d 5\a + ay] 
mo= — + - jlL . 

ay l-p[l-ayj 

If/S[l -ay] > 1, then the expected response time is infinite. 
Putting wi = WD = 0, it follows that d = ayco{v,P) and 

5[« + ay] 



niQ =w{v,p) + 



l-j3[l-ay]' 



(3.25) 



(3.26) 



(3.27) 
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3.7 A Diffusion Model 

In this section we replace the simple deterministic fluid model of Section 
3.2 with a simple stochastic diffusion model. Let W(t) represent the random 
amount of work present in D's buffer at time t. We will model (W(t); t > 0} 
with a diffusion process satisfying the following stochastic differential 



equation 




dW(t) = a(W(t)) + Vb(W(t)) dU(t) 


(3.28) 


where 




a(w) = A.(l-(w/B)P) [l+uw]s-l; 


(3.29) 




b(w) = X(l-(w/B)P) [l+\)w]^s^; 


(3.30) 



and {U(t); t > 0} is a standard Brownian motion. The infinitesimal mean of 
W(t), a(w), consists of two parts. When there are w units of work in the 
buffer a new packet arrives and enters the buffer at a rate X(l-(w/B)P); the 
packet brings a total amount of work to the buffer equal to his service time s 
plus the additional work due to retransmissions which we take to be uws as 
before.' Finally, the server works at a unit rate to clear work from the buffer. 
The infinitesimal variance of W(t), b(w), corresponds to the variance term for 
a compound Poisson process with arrival rate X,(l-(w/B)P) and second 
moment of the magnitude of arriving work which we take to be (l+uw)^s^. 

Figures 12-15 present sample functions of the following simulation of a 
discrete time approximation to {W(t); t > 0). In the following A = 0.1 is the 
time increment and {Uni are independent identically distributed standard 
normal random variables. The discrete time process evolves as 
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W*((n+1)A) = W(nA) + a(W(nA))A + Vb(W(nA))A Un+i 

where a(w) and b(w) are given by equations (3.24M3.25). Finally, the amount 
of work in the buffer at time (n+l)A, W((n+1)A), is W*((n+1)A) truncated to 
be within the interval [0,B]; that is, 

W((n+1)A) = max(min(W*((n+l)A),B),0). 

In Figures 12-15, the arrival rate of new packets is X = 0.5, the service time 

s = 1, the buffer size B = 10 and p = 20. The initial work in the buffer is 

W(0) = 1. In Figure 12, \) = 0.0 and {W(t); t > 0) remains less than 2 most of the 

time with occasional movement to values larger than 3 but less than 4. In 

Figure 13, i) = 0.1 and the behavior of {W(t)} is similar to that in the case when 

u = 0.0. However, the values of {W(t)} are somewhat larger. In Figure 14 

x> = 0.5 and {W(t); t > 0} is oscillating between small values and large values 

close to the buffer size of 10. This sample path behavior is called bistability. 

The behavior was suggested by the multiple zeros of the steady state fluid 

model equation (3.3). In Figure 15, v> - 0.8 and (W(t); t > 0} is also oscillating 

between large and small values with more of its time being spent at the full 
»* 

buffer level 10. If the time interval A is decreased, similar behavior results 
but it is less pronounced. 

A simulation was done of the work in the system after a packet arrival for 
Poisson arrivals of the packets. In particular, let Sp be the time of the n‘^ 
arrival and Wp be the amount of work in the system just after the arrival of 
the n‘*' customer. Let 

Wn+i = max((Wp-(Sp+i-Sp)},0) 
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be the amount of work in the system just before the (n+l)®* arrival. The 
amount of work in the system just after the (n+1) arrival is 



Wn+i = min({Wn+i + \)(Wn+i + l)s}, B). 



In the simulations Wq = 1. Figures 16-18 present sample paths of {Wnl for 
X = 0.5, s = 1, B = 10 and various values of \). In Figure 16, v = 0.3 and the 
amount of work in the system is small. Figure 17 with v = 0.5 shows some 
oscillation between small and large values of work. Finally, Figure 18 with 
x> = 0.8 shows a great deal of oscillation with more time being spent with a full 
buffer. Thus the diffusion approximation is representing the bistable 
behavior of the system with Poisson arrivals. 

3.8 The Density Function for the Steady State Amoimt of Work in the Buffer 
We assume the conditional density function of W(t), given W(0) = wq, 
satisfies the Fokker-Planck equation 




(3.31) 



where We have suppressed the dependence on wq. 
Integrate once on w, to obtain 




(3.32) 



Assume a limiting density function exists and as t — > « lim ^ F(t,w) = 0. 



Set f(w) = lim f(t,w). The density function f satisfies the equation 



•oo 



1 d 

0 = -a(w)f(w) + 2 ^ (b(w)f(w)). 



(3.33) 
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Solving (3.33) results in 






Hy) 



.0 



m 



w 






(l+uy)s h{y) 



m 



(3.34) 






w 




■ 


-2j 

L 0 




dy 



where C is a normalizing constant. 

Example. Figure 19 shows the density function (3.34) for the case p = 1 , A, = 0.5, 
s = 1, B = 10 and two values of \) = 0.1 and 0.8. The integral of (3.34) is 
evaluated using partial fractions. For u = 0.1, the mass of the density function 
is concentrated at small values of w. For u = 0.8, the mass of the density 
function is bathtub shaped with mass concentrated both at small values of w 
near zero and at large values of w near the buffer limit of 10. This 
corresponds to the fluid approximation with two stable points, W 31 and W 32 . 



4. CONCLUSIONS 

In response to the original tasking on this project, mathematical analysis 
has been conducted that exhibits the influence of packet size on system 
throughput and waiting time for elements of a data-transfer operation. 
Congestion control protocols are evaluated that tend to minimize the 
unwanted effects of retransmits on overall system performance, as measured 
by response times to request for data transfer. 
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Figure 1 
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Figure 10 
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Figure 11 



SIMULATION OF DIFFUSION;L=.5;S= 1 ;P=20;NU=0.0 




36 



Figure 12 
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Figure 13 
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Figure 19 
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APPENDIX A. PROBABILISTIC EVALUATION OF A PACKET SWITCHING 

PROTOCOL 

D. P. Gaver 
P. A. Jacobs 
P. Purdue 



Abstract 

A simple initial model is introduced to study the effect of the number of 
packets in a message upon data transfer time in the Defense Communications 
Agency's Defense Data Network (DDN). 

Key Words: Exponential packet transit time; number of packets per message; 
receiver blocking; infinite server queue; Erlang loss formula. 



1. INTRODUCTION 

The purpose of this report is to introduce an initial model for the effect of 
message size upon data transfer time in the Defense Communications 
Agency's Defense Data Network (DDN). The model is quite simple, but does 
contain certain qualitative features of the DDN, as we currently understand 
them. It is hoped that by explicitly formulating and exploring such a model 
we can elicit comments that will lead to improvement, and eventually result 
in a useful tool for some aspects of system planning. 
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Our modeling approach is to focus on the experience of a single tagged 
job, ], that is to be accomplished in an environment of many other jobs all 
utilizing, and competing for, a number of resources; packet switches and the 
links interconnecting them. We think of a job as meaning the transfer of a 
certain number of packets (of data) from one node to another in packet 
switching fashion. We assume to start with that the aspects of behavior of the 
particular tagged job, J, that are environmentally influenced are not changed 
by changes made in the way J's transmission is managed: in particular, 

attempts to transmit J in one piece, or in minimal packet-sized, "messages," 
or at some intermediate degree of message size (m packets) does not influence 
the environment. At a later stage of analysis we will attempt to represent the 
"environmental impact" of a policy that sends all (or many) messages of 
about the same size; perhaps this can be done by a process of iteration. Our 
initial approach seems necessitated by the complexity of the true packet 
switching setup, which involves buffers (queues) at the various nodes 
(switches) having limited capacities and elaborate protocols for traffic 
management. We start in as simple a way as seems qualitatively correct, and 

I* 

then plan to ratchet the model up to deal with whatever issues come to light. 

With this preamble, here is our initial message transfer protocol and 
model formulation. 

At an initial instant (t=0) a group of m>l packets is to be sent from a 
Source buffer, S, to a destination buffer, D, with the following protocol in 
effect: 
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Protocol. 



a) The m packets are sent out simultaneously via the network, 
destined for D. The first packet to arrive and find space for the m- 
packet message occupies a one-packet space and reserves space for 
m— 1 others, and sends an acknowledgement back to S. 

b) If no acknowledgement is received by S in time 6 each packet is re- 
transmitted; this action is repeated until acknowledgement occurs. 
Such is necessary because packets may encounter full buffers in 
transit and be lost, or may reach D, encounter a too-full buffer and be 
lost, or be lost for other reasons. 

c) Once the initial reservation is made a time must elapse until the 
remaining m-1 packets (originals, or duplicates thereof) reach D: 
each subsequent packet experiences delays similar to the first, but 
need not reserve space in D's buffer. 

Next we construct a mathematical model that reflects the effect of the 
above protocol and behavior. 

2. PROBABILITY MODEL FOR MESSAGE SERVICE TIME: TIME FOR THE 

FIRST ENTRY TO RESERVE BUFFER. 

Packets transmitted from S to D on a network may travel by different 
routes ^and arrive somewhat independently, at which moment there is an 
attempt to capture space in a finite buffer owned by D. The fact that a 
particular message’s packet must compete with others suggests these 
convenient assumptions: 

Ai: transit times T of individual packets from the tagged message are 
independently and identically distributed according to F(t); the form of this 
latter distribution will be discussed later; 

A 2 : when a packet of the tagged message arrives at D's buffer then if it is 
the first and is to be successful it must capture a space of size m — space for m- 
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1 packets besides itself. We assume that the probability of such capture, 
denoted by a=a(m), may be treated as a success probability in independent 
(Bernoulli) trials. This is suggested if each packet of the tagged message 
arrives independently and in the general company of others from many other 
sources arriving at, and demanding space in, D's buffer. Actual 
determination of the probability a(m) will be discussed later. 

On the basis of the Protocol and assumptions Aj and A 2 it is now possible 
to write down an expression for the probability that T(l), the time of first 
packet entry to D's buffer, exceeds t: 



P {T(l)>t} = 



lt/5) 

n 



i=0 



xm 

(aF(t-i5)+a) 



It/6] 



n (l-aF(t-i5)) 



i=0 



( 2 . 1 ) 



As usual [t/5] is the largest integer < t/6. 

This simply expresses the fact that no packet — either an original or a 
»♦ 

copy — has entered D's buffer by time t. Given F and a, and also 5 and m, one 
can evaluate the above probability, but in general this is a tedious numerical 
task. 



3. APPROXIMATION TO THE DISTRIBUTION OF T(l), THE 
RESERVATION TIME 

Note that taking logarithms of both sides of (2.1) gives 



4 
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lnP{T(l)>t} =m 



|t/5| ^ 

X ln(l-aF(t-i5)) 

i=0 



m 

T 



It/51 

X ln(l-aF(t-i6)) 

i=0 



(3.1,a) 



~ ^ J In (l-aF(x)) dx 



(3.1,b) 



where we have viewed the sum as a Riemann sum approximating the 
integral. Use of the Euler-MacLaurier sum formula may produce useful 
improvements. 

To get further explicit results assume that F(x) ~ (Xx)^; X, p>0 as X 0. 
This is a property of the Weibull and Gamma distributions, both reasonable 
candidates for describing transit times. Putting this into (3.1,b) yields after 
further Taylor expansion 

t 

lnP{T(l)>t) ~ j (Xx)P dx = XP 




so 



P{T(l)>t) ~ exp 



maXP tP^’ 

5 *p+l 



(3.2) 



and the distribution of T(l) is seen to be approximately Weibull. If P=l, the 
transit times T are exponentially distributed and the distribution of T(l) is 
approximately Rayleigh. In this case. 
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(3.3) 



E[T( 1 )] = j P{T(l)>t)dt ~ j exp{-^x2|dx = 

0 0 

as explicit a formula as is likely to be available. General Weibull (p>0 
arbitrary) results can be found in terms of gamma functions. Note that the 
appearance of the square root in (3.3) suggests that precision of determination 
of X and particularly a(m) may not be important. 

4. THE D-BUFFER MODEL 

The time until a reservation is made at D's buffer is obviously affected by 
the size of that buffer. Suppose it has capacity B, in packets. Then it is 
reasonable to make the following initial model, the inclusion of which into 
our previous model we term 

A 3 : the long-run or steady-state probability distribution of the contents of 
the D-buffer is 

Hj = K(B)aVj! j = 0,l,...B (4.1) 

where 2^j=o Ttj = 1 determines K(B). In other words, y/e characterize the buffer 

t* 

state as being truncated Poisson, as is true of an M/G/B/fi cjueueing system in 
equilibrium. Of course the parameter a must be estimated, and depends 
upon the rate at which new messages arrive at the particular buffer as well as 
their "holding time" in that buffer. The buffer si,ze, B, i§ a decision variable. 
Now given the above we let 

a(m) = Probability that space for at least m packets exists in the buffer at a 
"random" time 




51 



B-m 

= Z ( 4 . 2 ) 

j=o 

with TCj being given by (4.1). Furthermore, a(m) will be used as before, i.e., as 
the success probability in a Bernoulli trial (coin-flip) scheme. 



5. THE TIME TO COMPLETE MESSAGE TRANSMISSION 



Having made a reservation at D's buffer with the arrival of the first 
packet to find > m packet slots available we now wish to compute (an 
approximation to) the expected time to fill the remaining m-1 slots. It may be 
seen that the number Nj of duplicate packets outstanding for packet j is 
approximately Poisson with mean 1/(X6); simply view the network transit 
time, here taken to be exp(X,), as a service time in an M/M/«» queueing process 
with arrival rate 6, and consider the steady-state situation. Now let Sj be the 
time until outstanding packet j enters the buffer. Then if we use the Poisson 
approximation for re-transmits (See Appendix B) we find 



i* 



P{Sj > s) 



= E 



-XsNj -(s/5)[l-(l-e-^)/Xs] 
e e 



-s/5 

= e , s > o , 



where we have used the fact that E 



I" -XsNjj -l/X5[l-e^] 



(5.1) 



Now assuming 



that all subsequent transits, Sj, j = 1, 2, ... m-1 are independently and 
exponentially distributed according to (5.1) it turns out that the expected time 
to complete filling the m— 1 reserved slots is distributed as 

m-1 



p(T( 2) = max Sj < s} = (l-e"®/^) 



(5.2) 



52 



Consequently 



E[T(2)]=E[MaxSj]=6[l 




J~ 5 In(m-l). 



(5.3) 



Thus the total time to transfer an m-packet message is = T(l) + T(2), with 
expectation 



6. OPTIMIZING THE MESSAGE SIZE 

If a large number of packets, i.e., making up a data base, is to be 
transferred in messages of size m then renewal theory, see Ross (1985), can be 
invoked to show that the long-run rate of transfer of data packets is of the 
form 



where c is the time to acknowledge a successful transfer. In words, a reward 
of m packets is received every time a cycle of length E[Tm] + c completes. It is 
intuitiyely apparent that an optimal value of m will often exist. It can be 
found numerically, as indicated in the following simple and illustrative (but 
doubtless unrealistic) example. 

In this example a message size of 32 packets maximizes the transfer rate. 
A larger size is apparently much less effective because of the difficulty of 
reserving space in D’s buffer. Presumably a smaller size is less effective 
because acknowledgements consume too much time. 



E[Tm] = E[T(1)] + E[T(2)]. 



(5.4) 




( 6 . 1 ) 



53 



EXAMPLE 



a=25, B = 50 



p=l=X,5 = ^,c = l 



m 

(# packets in 
message) 


r(m) 
(Rate of 
transfer) 


E[T(1)1 


E[T(2)1 


2 


0.94 


0.63 


0.50 


8 


3.07 


0.31 


1.30 


14 


4.95 


0.24 


1.59 


20 


6.70 


0.21 


1.77 


26 


8.22 


0.25 


1.91 


32 


9.06 


0.52 


2.01 


38 


6.70 


2.56 


2.10 



Figure 1 displays the packet transmission rate as a function of the number 

of packets in a message for two values of the retransmission time 5. The 

other parameters are X = 0.5, B = 50, c = 1, a = 0.5. Note that the transmission 

rate for 5 = 0.75 is always less than that for 5 = 0.5. This behavior is due to the 

fact that the model for D's buffer is not affected by the value of 5. However, 

since all sources would be following the same retransmission policy a 
»» 

decrease in 5 may increase the congestion at D's buffer. 

Figure 2 displays the packet transmission rate for a model in which D's 
buffer is affected by 5. D's buffer is still modeled by an M/M/B/B queue. 
However, the parameter of the limiting distribution of the queue is a = 12.5/5 
rather than a = 25 as in Figure 1. The parameter 5 takes 3 values, 5 = 0.4, 
5 = 0.5 and 5 = 0.7. The other parameters are as in Figure 1. Note that 5 = 0.5 
has the largest maximizing transmission rate of the three. If 5 is too large, too 
much time is spent waiting to recover from a last packet. If 5 is too small, D's 
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buffer is more likely to be full and packets from S will be lost more frequently. 
The parameterization of a is ad-hoc for this model. Future models will 
address the effect of other sources' retransmissions on the environment a 
tagged packet is subjected to. 

7. CONCLUSIONS 

We have presented an initial model that illustrates the interplay of 
various system factors influencing choice of optimal message size in data 
transfer. This study is only a beginning, and must be expanded and improved 
in order to be truly useful. Nevertheless it does capture some features of the 
real situation. 

Examination of the table suggests that there is an advantage to increasing 
the message size m initially; this hastens the occurrence of a reservation, i.e., 
shortens E[T(1)]. However, too large an m quickly makes reservations in the 
D-buffer extremely difficult and extends the time E[T(1)] greatly. Naturally, 
beneficial changes in both 6 and B can influence the transfer rate as well; an 
overall optimization attempt should be placed in the agenda. Finally the 
effect 6f changing the retransmission time 5 on the environment of other 
packets that a tagged packet encounters is another subject of future study. 
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APPENDIX B. POISSON APPROXIMATION TO THE DISTRIBUTION OF 
THE ARRIVAL TIME TO THE BUFFER BY AN OUTSTANDING PACKET 
AFTER A MESSAGE RESERVATION HAS BEEN MADE 

Suppose a packet of a message has just made a reservation at D’s buffer. 
We are interested in an approximation to the distribution of the additional 
time required for the outstanding packet of the message to arrive at D. We 
will assume the network transit times for packets are iid exponential with 
mean 

We will approximate the process of retransmission of packet j by a 
Poisson process with rate ^independent of the other packets. Although 

packets are retransmitted at constant intervals, 6', this approximation may be 
justified if many of the packets are lost or damaged, in this case 5 > 5 '. 

Let Nj be the number of outstanding duplicates for packet j at the time 

the reservation is made. We will approximate the distribution of Nj by the 
limiting distribution of an M/M/oo queue with arrival rate ^ and mean 
servicei>time^. This approximation leads to Nj having a Poisson distribution 

with mean (X5)~L 

Let Uj be the smallest arrival time at D of the additional duplicate packets 
sent after the reservation is made. 
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1 

—s 



p{Uj > s} = f]exp|--is|L^^ “1 exp{-A(s-u)}i/« 

Let Sj be the time of arrival at D of the first of the duplicate packets for 
outstanding packet j. 

p{Sj > s} = E jp{Uj > s} 

= exp{-^[l - .->^l}exp{-i[As - (l - 



= expi--s 
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LONG RUN TRANSFER RATE OF PKTS;M/M/B/B MODEL OF D 
L=.5;BUFFER=50;C=1;ALPHA=25;DELTA=.5 .75 




3ivy NOissiiNSNvyi 
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MSG SZ IN PKTS 

Figure 1 
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MSG SZ IN PKTS 

Figure 2 
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